51(1.84 

I  i6r 

no. 997-1005 
1979-80 
lncompl . 
copy  2 


CENTRAL  CIRCULATION  BOOKSTACKS 

J™J£?°?  cl}arS'lnS  this  material  is  re- 

thTuu  f°i"  ltS  tenewal  or  ^s  return  to 
the  library  from  which  it  was  borrowed 
on  or  before  the  Latest  Date  stamneH 

fetors  SS7  ^  ChZ?ed  *  mSm 
Tee  ot  $75.00  for  each  lost  book 

the  University.  ^  '  ,n  dis,"'"«>l   from 

TO  RENEW  CAU  TELEPHONE  CENTER,  333-8400 


^TrSiyph^writen-due<fate^ 


/7<?.  ^y^  UIUCDCS-R- 79-999 


UILU-ENG  79   1747 


0*7^?- 


*♦• 


The   Schorr-Waite  Marking  Algorithm  Revisited 

by 
Nachum  Dershowitz 

November   19  79 


IHE  LIBRARY  OF  THE 

MAY  'i  s  i9ou 

UNIVERSITY  OF  ILLINOIS 
"RBANA-CHAMPAIGN 


«*M*r 


The  person  charging  this  material  is  re- 
sponsible for  its  return  to  the  library  from 
which  it  was  withdrawn  on  or  before  the 
Latest  Date  stamped  below. 

Theft,    mutilation,    end    underlining    of    booki    or*    reasons 
for    disciplinary    action    and    may    result    in    dismissal    from 
the  University. 
To  renew  call  Telephone  Center,  333-8400 

UNIVERSITY    OF    ILLINOIS    LIBRARY    AT    URBANA-CHAMPAIGN 


The   Schorr-Waite  Marking  Algorithm  Revisited* 


by 


Nachum  Dershowitz 


November,  1979 


*This  research  was  partially  supported  by  the  National  Science  Foundation 
under  grants  MCS-77-22830  and  MCS-79-04897. 


Digitized  by  the  Internet  Archive 

in  2013 


http://archive.org/details/schorrwaitemarki999ders 


Introduction 

There  has  been  a  recent  wave  of  interest  (Duncan  and  Yelowitz 
[1979],  Gerhart  [1979],  Gries  [1979],  Topor  [1979])  In  proving  the  correct- 
ness of  Schorr  and  Waite's  [1967]  algorithm  for  marking  all  the  nodes  in  a 
directed  graph  that  are  accessible  from  a  given  node  (see  also  Knuth  [1968] , 
Section  2.3.5).  In  this  paper  we  show  how,  by  developing  that  algorithm  in  a 
systematic  manner  (along  the  lines  of  Knuth  [1974])  from  the  standard 
depth-first  search  paradigm  (see,  for  example,  Tarjan  [1972]),  a  simple 
proof  of  correctness  follows  naturally. 

First  Version 

Given  a  finite  directed  (not  necessarily  strongly  connected)  graph 

G,   the  goal  is  to  set  an  "active"  bit  a(v)  to  true  for  those  nodes  v  in  G 

that  are  accessible  from  some  given  node  p.   It  is  assumed  that  initially 

a(v)   is  false  for  all  veG.   Each  node  v  has  a  given  outdegree  d(v)  with 

pointers  G  (v) ,G0(v) , . . . ,G,  ,  N  (v)  to  neighboring  nodes, 
i     l  d(v) 

The  following  recursive  program  accomplishes  that  task  by  perform- 
ing a  depth-first  search  of  the  graph: 

call  Mark(p) 
procedure  Mark(v): 
if  ~a(v) 

then  a(v)  :■  true 

for  c  :■  1  to  d(v) 


do  call  Mark(G  (v) ) 
c 


od 
fi  . 
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The  program  marks  the  nodes  accessible  from  p  by  first  marking  p  and   then 
recursively  calling  the  marking  program  for  each  of  the  neighbors  of  p. 

To  prove  that  this  recursive  program  is  correct,  four  things  must 
be  shown: 

a)  executing  the  program  leaves  the  structure  of 
the  graph  G  unchanged; 

b)  only  nodes  accessible  from  p  are  activated; 

c)  the  program  terminates; 

d)  upon  termination,  all  nodes  accessible  from  p  are  active. 
That  (a)  G  is  unchanged  is  obvious  from  the  fact  that  the  program 

contains  no  assignment  to  pointers  in  G.   By  induction  on  the  computation  it 

can  be  seen  that  (b)  Mark  is  only  called  for  accessible  nodes:  The  argument 

p   of  the  initial  call  is  accessible.   Furthermore,  assuming  that  Mark(v)  is 

called  for  an  accessible  node  v,  then  the  calls  Mark(G  (v) )  are  to  the  like- 

c 

wise  accessible  neighbors  of  v. 

The  termination  property  (c)  is  proved  by  structural  induction 
(Burstall  [1969])  on  the  number  of  unactivated  nodes  in  G:  Since  in  execut- 
ing Mark(v)  one  previously  inactive  node  in  G,  viz.  v,  is  activated  before 
calling  Mark(G  (v)),  the  inductive  hypothesis  asserts  that  each  such  call  to 
Mark  for  a  neighbor  of  v  will  terminate.  Since  Mark(v)  calls  Mark(G  (v) ) 
only  a  finite  number  of  times,  Mark(v)  must  terminate  as  well. 

Lastly,  the  following  line  of  reasoning  demonstrates  that  (d)  all 
accessible  nodes  are  activated:  Clearly,  the  program  never  deactivates  a 
node.  Thus,  after  any  execution  of  Mark(v),  v  must  be  active.  Now,  since 
all  nodes  are  initially  inactive,  if  upon  termination  a(v)  holds  for  any 
node  v,  then  it  must  be  on  account  of  the  assignment  a(v)   !■  true,   after 
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which  the  for-loop  was  executed  and  consequently  a(G  (v) )  must  also  hold  for 
all  the  neighbors  G  (v) ,  1  <  c  <  d(v),  of  v.  In  particular,  a(p)  holds  upon 
termination  of  Mark(p);  it  follows  (by  induction  on  the  length  of  an  access 
path)  that  a(v)  holds  for  all  nodes  v  accessible  from  p. 

Intermediate  Version 

As  is  well-known,  the  underlying  idea  of  the  Schorr-Waite  algo- 
rithm is  to  stack  the  arguments  of  the  recursive  calls  within  the  graph  G 
itself.  The  value  of  the  loop  counter  c  for  a  call  Mark(v)  is  stored  in  a 
field  c(v)  that  can  take  on  values  from  1  to  d(v)  +  1.  (These  c(v)'s  could 
be  placed  on  a  stack,  or  stored  within  the  nodes  v.)  Let  u  point  to  the  head 
of  the  stack  of  arguments;  the  continuation  of  the  stack  from  a  node  v  is 
stored  in  the  c(v)-th  pointer  from  v. 

We  are  led  to  the  program: 

(u,v)  :=  (null.p) 
call  Mark 
procedure  Mark: 
if  ~a(v) 

then  a(v)  :=  true 

for  c(v)  ;»  1  to  d(v) 

do  (v,u,Gc^  (v))  :-  (Gc^j  (v),v,u) 
call  Mark 


(Gc(u)  (u),v,u)  :'  (v,u,Gc(u)  (u)) 
od 


fi  . 


What  were  implicit  assignments  v  :»  p  and  v  :■  G  (v)  in  the  calls  of  the 
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first  version  are  now  explicit.  The  stack  u  is  set  initially  to  some  unique 
"null"  pointer  (actually  the  initial  value  of  u  does  not  affect  the  computa- 
tion); before  the  recursive  call,   the  node  v  is  added  to  the  top  of  the 

stack  u  and  the  c-th  pointer  from  v,  G  .    .  (v) ,  is  made  to  point  to  what  was 

c  (v)  r 

the  top  of  the  stack. 

The  proof  of  the  correctness  of  this  program  is  the  same  as  that 
of  the  previous  version  except  that  to  prove  that  the  structure  is  un- 
changed, that  hypothesis  is  combined  with  termination  in  the  inductive  argu- 
ment: One  may  assume  that  the  recursive  call  to  Mark  returns  with  the  values 
of  u,  v,  and  G  as  they  were  before  the  call.  Also,  the  counter  c(v)  is 
unaffected  by  the  recursive  call,  since  Mark  can  never  be  called  again  with 
v  inactive.  Thus,  one  need  only  verify  that  the  effect  of  the  multiple  as- 
signment 

(v,U,Gc(v)(v))  :=  (Gc(v)(v),V,u) 
is  undone  by  the  subsequent  assignment 

<GC(U )  (u),v,u)  :=  (v,u,Gc(u)  (u)). 

But  that  is  easy  to  see,  e.g.  the  original  value  of  u  was  in  fact  stored  in 

what  was  G  .  .  (v)  and  is  now  G  ,  .  (u) . 
c(v)  c(u) 

Final  Version 

In  this  section  we  derive  a  somewhat  more  efficient  marking  algo- 
rithm from  the  previous  version  by  a  series  of  program  transformations. 
Each  transformation  is  "correctness-preserving"  in  that  the  transformed  pro- 
gram is  guaranteed  to  be  correct  if  the  untransformed  program  was.  (See, 
for  example,  Gerhart  [1975]  for  methods  of  demonstrating  this  property.) 

As  a  first  step,  note  that  in  the  event  that  Mark  is  called  for  an 
active  node,   that  call  is  essentially  vacuous  and  all  the  stacking  before 
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the  call  and  unstacking  after  are  superfluous.   This  suggests  separating  out 
that  case  by  replacing  the  body  of  the  for-loop  with 


C  :"Gc(v)(v) 
If  ~a(t) 

then  (v,u,Gc ,  .  (v) )  :=  (t,v,u) 

call  Mark 


(Gc(u)(u),V,u)  :=  (v»u»Gc(u)(u)) 
fi  . 


With  this  change,  Mark  can  never  be  called  for  an  active  node.  Therefore, 
that  test  at  the  beginning  of  Mark  may  be  eliminated.  (Similarly,  if  the 
node  has  no  outgoing  edges,  then  the  call  does  no  more  than  set  the  active 
bit,  and  that  case,  too,  could  have  been  separated  out.) 

Finally,  one  may  remove  the  recursion  from  the  program  and  replace 
it  with  two  goto's  in  the  standard  manner.  To  detect  the  return  from  the 
outermost  call,  one  need  only  test  for  a  null  stack.  Thus,  our  final  mark- 
ing algorithm  is: 
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(u,v)  :=  (null.p) 
Mark: 

a(v)  :=  true 

for  c(v)  :=  1  to  d(v) 


do  t  :-Gc(v)(v) 


If  ~a(t) 

then  (v,u,G     (v))  :=  (t,v,u) 
goto  Mark 
cont'd: 

(Gc(u)(u)'V'u)  :=  (v»U'Gc(u)(u)) 

f_l 

od 
if  u  *   null  then  goto  cont'd  _fi  . 

This  "final"  version  can  be  further  modified  according  to  one's 
particular  needs  and  tastes.  It  can,  for  example,  be  adapted  to  the  case 
where  all  nodes  have  outdegree  zero  or  two  (as  in  LISP)  by  unwinding  the 
f or-loop. 

Conclusion 

We  have  shown  how  the  sophisticated  Schorr-Waite  marking  algorithm 
may  be  developed  from  a  straightforward  depth-first  marking  algorithm  by  a 
short  sequence  of  relatively  simple  modifications.  Other  (more  or  less  de- 
tailed) derivations  of  this  or  related  algorithms  are  pursued  in  Soule 
[1977],  Griffiths  [1979],  and  Lee,  Gerhart,  and  deRoever  [1979];  some  of 
those  references  go  on  to  develop  more  involved  algorithms. 
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We  have  also  seen  how  the  correctness  of  the  last  version  follows 
without  difficulty  from  the  (virtually  obvious)  correctness  of  the  original 
version.  Developing  the  program  in  steps  has  made  it  easier,  in  our  view, 
to  understand  and  validate  the  final  result. 
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